Dynamics underlying Box-office: Movie Competition on Recommender Systems 
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We introduce a simple model to study movie competition in the recommender systems. Movies of 
heterogeneous quality compete against each other through viewers' reviews and generate interesting 
dynamics of box-office. By assuming mean-field interactions between the competing movies, we 
show that run-away effect of popularity spreading is triggered by defeating the average review score, 
leading to hits in box-office. The average review score thus characterizes the critical movie quality 
necessary for transition from box-office bombs to blockbusters. The major factors affecting the 
critical review score are examined. By iterating the mean-field dynamical equations, we obtain 
qualitative agreements with simulations and real systems in the dynamical forms of box-office, 
revealing the significant role of competition in understanding box-office dynamics. 
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I. INTRODUCTION 

Dynamics underlying movie box-office has been an im- 
portant issue for both academic and commercial interest 
in the past decades of years [l|-[3[ ■ As compared to the 
traditional advertising campaign, recommender systems 
nowadays provide reviews and scores which constitute a 
platform for movies to compete among each other di- 
rectly. Due to the popularity of the Internet, such com- 
petitions play an increasingly significant role in driving 
movie box-office. Despite their importance, only single 
movie dynamics is considered in conventional approaches 
[J, |5j, leaving competitions through recommender re- 
views unattained. Such movie interactions are in par- 
ticular interesting to understand the physics of movie 
competitions in influencing box-office dynamics. 

Box-office dynamics has been studied using different 
approaches ranging from statistics to neural networks. 
In early approaches, potential viewers consider the past 
total success of a movie [1] , or the decision of its prede- 
cessor [2| , to decide whether to watch the movie. Though 
multiple movies are considered in these approaches, com- 
petition though modern recommender systems is not ad- 
dressed. Movie competitions are considered in Q but dy- 
namics of the competing movies is not examined. On the 
other hand, single movie dynamics is studied by differen- 
tial equations [J] and the spread of movie awareness by 
automata or percolation |5|. To understand and predict 
the box-office dynamics, empirical studies are conducted 
a, Ml and statistics based forecast Q and neural networks 
9, [l0| are employed. Some of these approaches consider 
individual factors such as nations, genre and star values, 
and may overlook the importance of competitions among 
movies. 

In this paper, we introduce a model in which movies 
compete with each other through reviews posted on a 
recommender system. Viewers post their reviews and 
drive other potential viewers to watch the movie, which 
in turn produce new reviews driving another group of 



viewers. The present reviews driven mechanism spread 
movie popularity and generate interesting dynamics in 
movie box-office. Here we consider movie reviews as both 
indicators as well as influencers of box-office, as suggested 
by empirical data in [6] . To capture only the essential el- 
ements, movies in our model are only differentiated by 
their quality and time of introduction. As different from 
approaches which consider heterogeneous viewers, we as- 
sume that potential movie goers are homogeneous and 
are only driven to watch movies either by movie reviews 
or movie freshness. All these ingredients constitute a 
simple model which facilitate the illustration of physical 
phenomenon behind movie competitions. 

We will show that, by mean- field approximation, the 
average review score characterizes the critical movie qual- 
ity necessary for booms in movie box-office, and corre- 
sponds to a transition from box-office bombs to block- 
busters. The physical reason behind the booms is the 
success of the movies in spreading popularity through the 
recommender systems, creating cascades and dynamical 
hits after their introduction. Though we are not able 
to provide an accurate estimate of the average review 
score, we show that the analytical results have quantita- 
tive agreements with simulations and real data, suggest- 
ing the present model in describing the fundamentals of 
movie competitions. Finally, we generalize the mean- 
field approximation to analyze the competitions of two 
good movies and show that box-office dynamics of the 
competing movies are anti-correlatcd. 

The paper is organized as follows. We describe the for- 
mulation of the model in Section |TTJ In Section IIII1 we 
employ the mean-field approximation for movie interac- 
tion and discuss the dependence of gross box-office and 
box-office dynamics on the quality of movies. The com- 
petition between two good movies is discussed in Section 
IIII CI We finally compare our approximation with sim- 
ulation and empirical results in Section llVl Conclusions 
are given in Section [V] 



II. MODEL FORMULATION 

We consider a community of N agents aiming to watch 
movies of high quality. At each time step, a fraction p 
of the agents are chosen to be the potential viewers, out 
of which a fraction ui, which we call the trendiness, in- 
tend to watch a new movie. A new movie is introduced 
at each step. Each movie a, introduced at time t a , is 
characterized by quality Q a randomly drawn from the 
distribution p(Q a ). Starting from the second step af- 
ter their introduction, movies are considered to be old 
and are recommended to agents by a centralized recom- 
mender system, which is in the form of a review list as 
shown in Table HI The remaining potential viewers (who 
have no intention to watch the new movie) select an old 
movie from the list. Thus, a high uj also corresponds to 
a small dependence of movie goers on the reviews. For 
a particular movie a, we denote the number of viewers, 
i.e. the box-office revenues, at time t to be k a (t). 

In reality, potential viewers of a new movie sense 
the quality of movie from advertisements. We thus as- 
sume that they watch the new movie a with probability 
7i"a oc Q s a , such that the "opening" k a (t a ) oc NptuQ s a . A 
suitable proportionality constant would be Q^Lv where 
Q < Qmax as restricted by p{Q). When s — 0, potential 
viewers watch the new movie regardless of its quality. We 
set s > when agents have a good sense of movie quali- 
ties to make potential viewers inclined to movies of high 
quality. Thus, we call s the quality sensitivity. 

On the other hand, potential viewers of old movies de- 
cide to watch an old movie by gathering information from 
websites of movie reviews, movie magazines or word-of- 
mouth recommendations from peer viewers. We express 
this kind of centralized recommendations by a list of 
movie reviews as shown in Table HI Movie popularity 
thus spread among the agents through the list. For sim- 
plicity, the reviews are expressed in the form of scalar 
scores. At a particular time, we denote the total num- 
ber of reviews on the list as L and reviews are labeled 
by r = 1, . . . , L. The movie and its corresponding score 
on the Z-th reviews are denoted respectively as m r and 
u r . For a potential viewer i, the probability to choose a 
movie a from the list is 



Movie ID 


Movie Score 


a 


5 


7 


3 


P 


4 


7 


2 


a 


5 



TABLE I: Examples of movie reviews on the recommender 

system. 



for the recommendation of objects where in the long run 
freshness is important. 

As viewers may provide generous or harsh critics, we 
assume that they post their reviews with probability 
T] a oc Q 9 a . When g = 0, they review the movie regardless 
of its quality. The volume of reviews is thus an indicator 
of the box-office. When g > 0, they tend to give good 
comments and review movie of high quality. By contrast, 
g < represents an opposite phenomenon as reviewers 
tend to give bad comments, making bad movies more 
popular (by number of reviews) than good movies in the 
recommender system. We thus interpret g to be the gen- 
erosity of the viewers. In this paper, we define popularity 
of a movie to be the fraction of reviews on the movie 
(over all current reviews). 

To further simplify the model, we combine Eq. ((T|) and 
the generosity, and assume that the score u a of a movie 
is simply its quality Q a , which implies that all agents in 
the model are rational and homogeneous. Note that in 
this case u a is continuous and can be considered as the 
average estimated quality by the reviewers. Eq. (JXJ) thus 
becomes 



2-~ir=l\ a i,m r )Wm r ®mr, 
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(2) 



When g = — 1, viewers select a movie merely by popular- 
ity on the list, regardless of quality ,which is suggested 
by the results of empirical study in [6| . 
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where a ia = 1 if viewer i has already watched movie 
a and otherwise 0. Eq. (JXJ) characterizes the competi- 
tion of on-list movies. Only reviews from the previous 
step are shown on the list, i.e. reviews before the previ- 
ous step are deleted. The up-to-date reviews lead to the 
natural evolution of box-office. It can be shown that by 
this clearing mechanism, the total number of viewer for 
a movie is independent of its time of introduction (given 
that the observed time is longer than the lifespan of the 
movie) , as different from the first- mover effect in citation 
of scientific papers [11]. This is of particular importance 



III. BOX OFFICE DYNAMICS - THE 
MEAN-FIELD APPROXIMATION 

We start to investigate the box-office dynamics of 
movies by mean-field approximation. As mentioned, 
movies compete with each other by interaction through 
the review list, as potential viewers select one of the on- 
list movies by comparing their quality and popularity. In 
the mean-field approximation, we assume a mean inter- 
action between movies and denote (u) to be the average 
score over reviews of on-list movies, which have been in- 
troduced for at least two steps on the recommender sys- 
tem. In other words, at time t with m r denoting the r-th 



movie on the list, (u) is given by 

/ \ ._ £r=l W - tm r + 2)l 
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movie quality, (u) = (Q} . We then approximate Eq. (|2]) 
for movie a introduced at time t a by 



(3) 



where the heaviside function 6{y) = 1 for y > and 
otherwise 0. As we have set the review score to be the 
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for t = t a + 1] 
for £ > t a + 1, 



(4) 



where movie 7 corresponds to the movie being introduced 
at time t 1 =4—1. Readers may argue that the average 
of i^i. a {t) instead of Q a would be a better approximation 
than Eq. (QJ. Nevertheless, we keep the present form of 
approximation as it greatly simplifies the analysis, while 
capturing the essential features and facilitating physical 
interpretations. Justification is given in Section IIVI and 
the discrepancy between the present approach and the 
simulation results is described. 

From Eq. (U), we write down the iterative equations 
which describe the box-office dynamics k a (t). To sim- 



J 



plify the notation, we define f a (t) = k a (t + t a )/Np, cor- 
responding to its popularity. Thus, popularity of an on- 
list movie is proportional to its box-office at every step. 
The initial popularity / a (0) is proportional to ujQ s a . We 
further approximate (Q 9+1 ) by (Q)o +1 , which turns out 
to be a good approximation as the review list is usually 
dominated by several good movies which have similar 
quality. The expression for subsequent f a {t) is given by 



(l-w)[l-p/a(0)] 



fa(t) = I 
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for t = t a + 1, 

(5) 
for £ > t a + 1. 



These equations can be iterated numerically to generate 
the dynamics of box-office. 



A. Booms in gross box-office 

To obtain the relation between gross box-office and 
movie quality, we adopt again the mean-field approxi- 
mation for interactions with new movies, and approx- 



imate / 7 (0)Q 7 +1 by u{Q 



l+s+9> 



p/Qn 



which is the 

value averaged over the distribution p(Q) in the relation 
/ 7 (0) oc wQ 7 , subject to the proportionality constant 
Qmax- The quantity (Q) is dependent on the dynamics 
of box-office and competition between on-list movies, and 
hence is difficult to compute. Nevertheless, we approxi- 



mate (<2) by 
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where Q s+9 is proportional to the probability for the 
movie to appear on the recommender list after its in- 
troduction. We do not claim that the above expression 
is a good approximation of (Q) , but we will see that a 
rough estimate of (Q) is sufficient to generate the fea- 
tures of the model. We then iterate Eq. ([5]) numerically 
to obtain ^2 t f a {t), which is proportional to the gross 
box-office K a . 

The rescaled gross box-office K a /N is shown in Fig. Q] 
as a function of Q a . The gross box-office of a movie shows 
a drastic rise, i.e. a boom, when its quality is beyond the 
average review score (Q) Q of existing competitors. (Q) Q 




Q7—QI 
/„(i)>/„(0) 

§^/„(2)>./„(l) 

0.2 0.4 0.6 0.8 

CO 



e„/<G>„ 




B-EJ 1.4 

-+1.3 

G-G1.2 

X-K 1.1 

A-A 1 
>«0.9 

0.8 
H — H0.7 

0.5 
x-^<0.1 



FIG. 1: (Color online) The gross box-office of a movie a as a 
function of Q a - Parameters: p — 0.05, s — 1 and <? = 1. (Q) a 
are obtained with p(Q) ~ Q~ 2 with Q ma x = 50. Inset: The 
same graph in magnified vertical scale. 



thus characterizes the critical quality of movies to become 
blockbusters. We note that different forms of p(Q) alter 
(Q) , but not the general picture of booms. Remarkably, 
the gross box-office does not show a large dependence on 
u>, the ratio of agents who intend to watch a new movie. 
However, we will see that the dynamics of f a does show 
a dependence on u>. 

To get a better understanding of Fig. [TJ we obtain 
an explicit asymptotic form of the gross box-office by 
considering Eq. ([5]) in the large t limit. In this case, 
0(f a (t)) <§C 1 and the terms with / Q -1 (£ — 1) becomes 
dominant in the denominator. We thus ignores terms of 
O(l) and rewrite Eq. ([5]) as 



fa(t) = 



(1 



UJ 



l-pEt= /a(t / )l/ a (*-l)Q|+ 1 



/ 7 (0)Qr i + (l-c)(Q)g 



s+i 



(7) 



For movies of low quality, J^t fa(t) ls negligible and f a (t) 
follows an exponential decay. For movies of high quality, 
their popularity on the recommender system is long last- 
ing, and f a (t) ~ fa(t— 1) when t is large. The gross 
box-office of the movie is thus given by 
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(9) 



where the last line is valid only when the second 
term dominates in the numerator, i.e. the average 
on-list movie quality is high such that (Q)^ +1 ^> 
(Q s+9+1 ) P /Qt- nax - We thus see that when on-list movies 
are of high quality, the gross box-office is only weakly 
dependent on w through (Q) - As Q a — > oo, K a /N — > 1. 
From Fig. [T] and Eq. (JSJ), we can estimate the critical 
quality Q c for box-office boom to be 



(Q)o, 



(10) 



FIG. 2: (Color online) } a {i) as a function of t for movies of 
various quality Q a . Parameters: p = 0.05, w — 0.3, s = 1 and 
g — 1. (Q)o are obtained with p(Q) ~ Q~ 2 with Q max = 50. 
Inset: The phase diagram for hits in box-office. The critical 
quality Qc and Q c as shown respectively by the solid and 
dashed lines, as a function of u>. 



which implies blockbusters appear at lower quality given 
that competitors are bad movies. 

To obtain the degree distribution of movies, we exam- 
ine Y^t fa{t) for movies with low Q a . As shown in the 
inset of Fig. [3 K a /N rj pf a (0) for movies with very low 
quality, due to the fast decay of their popularity after 
introduction. Assuming movie quality is distributed as 
p(Q) ~ Q 7 7 the distribution of gross box-office K is 
given by 



P(K) - K' 



(11) 



which is valid for movies of low quality. P{K) shows a 
long tail for large K, due to the boom in gross box-office 
for Q > Q c . 



B. Hits in box-office dynamics 

To examine the box-office dynamics, we show in Fig. [5] 
how f a (t) evolves with time. For movies of high quality, 
their popularity increases after their introduction, corre- 
sponding to an immediate hit in box-office (see for in- 
stance the line with Q a /(Q)o = 1-4). We thus consider 
/a(l) > /a(0) which implies Q a greater than a critical 
quality Q c , given by the positive root of equation 



w(l — puj +p) 
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(12) 



The corresponding Q c /{Q)o f° r different values of u are 
shown by the solid line in the inset of Fig. [5J which im- 



plies that immediate hit occurs for movies with quality 
in the shaded region. Though explicit solution for Q c is 
difficult to obtain, we see from Eq. (fT"2|) that when s = 0, 

only the rescaled quantity Q c /(Q)o is relevant. Thus, 
defeating existing movies on the reviews list is a major 
physical origin for an immediate hit in the box-office dy- 
namics. Moreover, a hit is more likely to occur at time 
with bad movies in the list, i.e with low (Q) - 

The solution of Q c exists only when the trendiness is 
smaller than some threshold uj c given by the root of the 
following equation 



Epul + [5(1 + p) + l]w c -1 = 0, 



(13) 



when s = g = 1, where 5 = 3y/3{Q)^/2Q max . Beyond 
the threshold, an immediate hit does not occur regardless 
of movie quality. It implies that this phenomenon is less 
likely to occur when viewers have high intention to watch 
new movies. From Eq. (|13p . we see that co c increases with 
decreasing (Q) , implying that an immediate hits occur 
at high trendiness when on-list movies are bad. 

Another interesting behavior is observed in the range 



,(11) 



,(i) 



of Qc < Q a < Qc as shown in striped region of 
the inset of Fig. O This range of quality corresponds 
to movies with quality above {Q) a , where their popu- 
larity shows an immediate drop after introduction and 
rises afterwards (see for instance the lines in Fig. [2] with 
Qa/(Q)o = 1.1 and 1.2). Physically, at the first step after 
introduction the immediate drop in popularity is induced 
by the high trendiness, together with competitions with 
existing good movies. At the second step, the popularity 
rises as a new movie which is usually of lower quality is 
reviewed by the viewers. Such behaviors are observed in 
simulations and real data of Netflix in Fig. [6l a website 
for movie recommendation. 

These hits are formed by the successful spreading in 
the recommender systems, as similar to other forms of 
cascade, such as information cascade [2|. Early viewers 
post their positive reviews and attract more viewers, re- 
inforcing the "good gets richer" effect [12J]. After the hits, 
many users watched the movie and its on-list popularity 
ultimately drops. 

We expect such hits in box-office, either at the first or 
second step, to be the main cause for the gross box-office 
boom in Fig.[TJ Despite a small change in Q a /(Q) from 
0.9 to 1.1, a comparison of f a (i) in Fig. [3] reveals that the 
hits result in large J^t /o(i)i which is consistent with the 
result shown in Fig. [TJ We thus expect that Q c for boom 
coincide with Q c ' for hits. These results show that the 
outcomes of the competitions with on-list movies are cru- 
cial in box-office dynamics and gross box-office. Defeat- 
ing existing movies in reviews creates hits and lead to 
boom, while losing the competition suppresses popular- 
ity and results in small box-office. 




FIG. 3: (Color online) Upper penal: f a (t) and fp(t) as a 
function of t for movie a and /3 respectively introduced at time 
tot and tot + 1 with quality Q a — Qp = 1A{Q) . Parameters: 
p = 0.05, id = 0.4, s = 1 and g — 1. (Q) are obtained with 
p{Q) ~ Q~ 2 with Q m ax = 50. Inset: The same figure with 
Q a — 1.5(Q) and Qp — 1A{Q) . Lower penal: detrended 
f a (t) and fp(t), by subtraction of moving average. 



C. Competition between good movies 

Slight modifications of Eq. ([5]) allow us to write down 
the box-office dynamics of two movies introduced one 
after another, and enable us to study more directly the 
competition. We consider movie /3 introduced after movie 
a, i.e. tp = t a + 1. As the derivation is similar to the 
that of Eq. (O, the coupled equations for f a (t) and fp(t) 
are given in the appendix. 

We first examine the competition between two good 
movies with equal quality. As shown in the upper pe- 
nal of Fig. H with Q a = Q p = 1.4(Q) , both f a (t) 
and fp(t) show small oscillations intervening with each 
other. A simple detrending, e.g. subtraction of moving 
average (lower penal of Fig. [3]), reveals that their box 
offices are anti-correlated, implying that they are com- 
peting for viewers through the review list. As compared 
to the case with a single movie in the mean-field approx- 
imation, both f a (t) and fp(t) show a small or no hit 
shortly after its introduction, but their popularities last 
longer as they have higher tails. Remarkably, we found 
that the gross box-office of the competing movies show 
only a small drop when compared to its counterpart in 
the single movie case, as the tails are long for both f a (t) 
and fp(t). Such behaviors are not observed in finite size 
systems, as small f a (t) and fp(t) in the tail is not suffi- 
cient to spread popularity, when k a (t),kp{t) < 1. 

For movies of identical quality, we can simplify 
Eq. (|A1[) to get a simple relation between f a (t) and fp (t) 
when t > t a + 2, as given by 






U(t-i) 



1) 



(14) 
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The relation shows that during the competition, popu- 




0.5 



FIG. 4: The average quality and the number of movies on 
the recommender list in simulations, as a function of g. Pa- 
rameters: p — 0.05, id = 0.5, s — 1 and g — 1. Quality of 
movies is drawn from p(Q) ~ Q -2 with Q m ax = 50. Inset: 
the fraction of relevant reviews as a function of g. 



larity is influenced by two factors: (1) the number of 
accumulated popularity ^2 t 7—of(f), and (2) the popu- 
larity f(t— 1) in the previous step. Physically, it implies 
that when two movies of equal quality are competing, 
the one which accumulated less box-office get a slight 
bias in popularity, as a higher portion of agents did not 
watch the movie. This increases the popularity of the 
biased movie at this step and has a reinforcing effect for 
its popularity in the next step. However, such effect does 
not last forever as the number of accumulated popularity 
ultimately increases and an opposite trend starts. This 
phenomenon causes a balancing effect on the popular- 
ity of the competing movies, and results in intervening 
between the movies' popularity, leading to similar gross 
box-office for the two movies. Such intervening is more 
prominent when the two movies differ in their quality, as 
shown in the inset of Fig. [3l 



IV. COMPARISON WITH SIMULATION AND 
EMPIRICAL RESULTS 

Finally, we compare our theoretical mean-field approx- 
imation with simulation and empirical results, and look 
at details which are not captured by the mean-field ap- 
proach. 

In Fig. 21 we show the average quality of movies and 
the number of movies on the review list in simulations. 
As expected, when g increases, the quality of movies on 
the list increases. However, the number of on-list movies 
decreases: choice becomes more limited when reviewers 
tend to review good movies. It corresponds to a trade- 
off of quality with diversity, which is found in other rec- 
ommendation systems [13J . Note that (Q) corresponds 
to the average over all on-list movies, which is different 
from (Q) in the mean-field approximation. Neverthe- 
less, (Q) shows the same trend as (Q). 



■ ■ approximate (Q) 
- - semi-empricial (Q) 
- fitting (G> 
simulation * . 




FIG. 5: (Color online) The popularity f(t) from simulations 
(symbols) and mean-field approximation with semi-empirical 
{Q)o (dashed lines). Parameters: p = 0.05, cj = 0.5, s = 1 
and g = I . Quality of movies is drawn from p(Q) ~ Q~ 2 with 
Qmax = 50. Inset: gross box-office as a function of movie 
quality. The corresponding values of {Q)o for analytic, semi- 
empirical and fitting are 33.7, 27.9 and 15. 



We further define the fraction of on-list reviews from 
movies which have not been watched by a potential 
viewer i at time t to be <f>i(t), given by 



■•(*) 



E 

r=0 



9(t - t„ lr + 2)oi 



(15) 



The average value of <f> over users and time is shown in the 
inset of Fig. 21 As g increases, (<j>) decreases which show 
that when diversity decreases, viewers are more likely 
to find a watched movie on the list. This makes the 
recommender system less effective. 

In the inset of Fig. [5J we then compare the K — Q 
relation between the mean-field results ([9]) and the simu- 
lation results. Despite the fit with the approximate (Q) 
(from Eq. ©) does not agree with the simulation results, 
it does capture the features of the drastic increase in K a 
when Q a is beyond a critical quality. We thus incorpo- 
rate the simulated values of (Q) in Eq. ©, which is 
shown by the green dashed line as semi-empirical predic- 
tion. Though capturing the trend of the drastic rise, the 
prediction is below most of the data points. We have ex- 
amined the origin of the discrepancy by comparing 7Tj jQ ,(£) 
in simulations and the predicted values in the mean-field 
approximation, which shows that averaging ni >a (t) in- 
stead of only its denominator (as in the present approach) 
would yield a better approximation. Nevertheless, incor- 
porating a smaller (Q) than the simulated value gives 
the red solid line in the inset of Fig. [5] which agrees well 
with simulations. It implies that the present approach 
captures the major features of the model, given an es- 
timate of effective (Q) - As a result, competition with 
the effective (Q} a is thus the major factor in driving box- 
office dynamics. 

We then compare the simulated with the predicted /(£) 
incorporated with semi-empirical values of (Q) Q in Fig.[S] 
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FIG. 6: (Color online) The popularity f(t) from 5000 movies 
on Netflix in the period from Oct 1998 to Dec 2005, with K, 
the total number of viewers as indicated in the range. The 
number of active users is 4.8 x 10 5 in this period. The symbol 
x indicate the average weekly "opening" of movies. 



We have improved the mean-field prediction by incorpo- 
rating the simulated (4>) as a coefficient for (Q 9+l ) in 
Eq. Q. The simulated f(t)'s are obtained by averaging 
box-office of movies within a small range centered at the 
indicated Q a . As can been seen, the predicted f(t) fits 
well for large value of Q, and the simulated f(t) show 
also the hits starting from the second step. We remind 
readers that physical origins of such hits are discussed in 
Section llllBI These simulation results also show that the 
hits in dynamics is the reason for gross box-office boom, 
as predicted from our analysis. 

Finally we show the popularity f(t) of movies as ob- 
tained from Netflix. The number of reviewers N r on Net- 
flix increases with time due to the its increasing popular- 
ity, we thus put f a (t) = k a {t)/N r (t) which corresponds 
to the share of reviews on movie a at time t. As the 
intrinsic quality of movies are not known, we suggest to 
distinguish movie quality by the number of total viewers 
K, and plot in Fig.|6]the average f a (t) from movie a with 
K a falling in a particular range. From Fig. [HI the two up- 
per curves show a high "opening" and an immediate drop 
in the second weeks, while a prominent hit occurs after- 
wards. They correspond to /(£) from movies with large 
K, i.e. movies of high quality. For movies with small K, 
the inset of Fig. [B] shows in expanded vertical scale that 
hit is only observed for movies with 10 3 < K < 10 4 , but 
not for movies in the groups of lower K. These empirical 
results show qualitative agreements with the results ob- 
tained in the present model, which suggest the validity 
of the present description for the fundamental box-office 
dynamics. 



ing the mean-field approximation for movie interaction, 
we show that, for movies defeating the average review 
score, their popularity spreads through the review sys- 
tems as similar to other forms of cascade. Popularity 
hits are formed either at the first or second steps after 
the introduction of these movies, and result in booms in 
gross box-office. The average review score thus character- 
izes the critical quality of movies to become blockbusters. 
Such average score represents the average quality of peer 
competitors which implies hits are more likely to occur 
when competitors arc bad movies. On the other hand, 
less generous reviewers and low intention for watching 
new movies create more prominent hits. These results 
show that the outcomes of the competitions with on- 
list movies are crucial in box-office dynamics and gross 
box-office. Defeating existing movies in reviews creates 
hits and lead to boom, while losing the competition sup- 
presses popularity and results in small box-office. 

Generalizing the mean-field approximation allow us to 
analyze and show that the box offices of two compet- 
ing good movies are anti-correlated, which produce long 
lasting awareness as compared to the case of single good 
movies. The model reveals the significant role of movie 
competition in understanding box-office dynamics. 

We remark that the model can be modified to study 
dynamics of movie viewers. For example, more reviews 
can be shown on the list by storing reviews of older than 
one day, according to popularity movie quality or a suit- 
able decay function. Agents and movies of heterogeneous 
taste and attribute can be modelled, while review scores 
are given according to the corresponding overlap. Such 
modifications may reveal more fundamental aspects and 
interesting dynamics driving movie box-office. 
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Appendix A: Dynamical equations of correlated 
popularity 



V. CONCLUSION 

We studied the competition of movies through reviews 
in a simple model of recommender system. By adopt- 



In this appendix, we write down the dynamical equa- 
tions for f a (t) and fp (t) discussed in Section MI CI re- 
garding the competition between movie a and /3 intro- 
duced one after the other. The coupled equations are 
given by 
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for t = t a + 1, 

for t = t a + 2. 
(Al) 



for t >t a + 2. 

for i = t a + 2, 

(A2) 
for t > t a + 2. 
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